# ViT Fine-tuning

Vit Beans
Apache-2.0
A Vision Transformer model fine-tuned on the beans dataset based on google/vit-base-patch16-224-in21k for image classification tasks
Image Classification Transformers
V
SangjeHwang
17
1
My Trash Classification
An image classification solution implemented using Hugging Face's pre-trained Vision Transformer (ViT) model, capable of classifying images into six types of garbage
Image Classification Transformers
M
tribber93
259
1
Brand Identification
MIT
This model is a logo recognition model fine-tuned based on Google's Vision Transformer (ViT), specifically designed for classifying UAE company logos.
Image Classification Transformers English
B
Falconsai
478
1
Clip Vit Base Patch32 Stanford Cars
A visual classification model fine-tuned on the Stanford Cars dataset based on the CLIP Vision Transformer architecture
Image Classification Transformers
C
tanganke
4,143
1
Bhutanese Textile Model
Apache-2.0
A Bhutanese textile image classification model fine-tuned based on Google's ViT model
Image Classification Transformers
B
Dalaix703
50
1
Celebrity Classifier
Apache-2.0
A celebrity classification model based on Google Vision Transformer (ViT) architecture, designed to recognize 1000 top celebrities
Image Classification Transformers
C
tonyassi
394
5
Vit Base Patch16 224 Finetuned Flower
Apache-2.0
A Vision Transformer model fine-tuned on a flower image dataset based on Google's ViT-Base-Patch16-224 model
Image Classification Transformers
V
adamtky
15
0
Platzi Vit Model Mewita
Apache-2.0
An image classification model fine-tuned on a legume dataset based on Google's ViT model, achieving 97.74% accuracy
Image Classification Transformers
P
platzi
15
0
Histo Train
Apache-2.0
An image classification model fine-tuned based on google/vit-base-patch16-224, suitable for histology image analysis tasks.
Image Classification Transformers
H
tcvrishank
36
0
Fun
Apache-2.0
A vision model fine-tuned based on google/vit-base-patch16-224, suitable for image classification tasks
Image Classification Transformers
F
tcvrishank
16
0
Vit Base Letter
Apache-2.0
An image classification model fine-tuned on a letter recognition dataset based on Google's ViT base model, achieving 98.81% accuracy
Image Classification Transformers English
V
pittawat
93
2
Vit Base Patch16 224 Finetuned Flower
Apache-2.0
A Vision Transformer model fine-tuned on a flower image dataset, based on Google's ViT model
Image Classification Transformers
V
CHAOYUYD
35
0
Vit Base Beans
Apache-2.0
This model is a fine-tuned image classification model based on Google's ViT-base model on the beans dataset, achieving an accuracy of 98.5%.
Image Classification Transformers
V
leejw51
20
1
Vit Base Patch16 224 Finetuned Flower
Apache-2.0
A visual classification model fine-tuned on flower image datasets based on Google's Vision Transformer (ViT)
Image Classification Transformers
V
chanelcolgate
18
0
Vit Base Patch16 224 Finetuned Flower
Apache-2.0
A vision Transformer model fine-tuned on flower image datasets based on Google's ViT model
Image Classification Transformers
V
smakubi
35
1
Vit Base Patch16 224 Finetuned Flower
Apache-2.0
A Vision Transformer model fine-tuned on flower image datasets based on Google's ViT model, suitable for image classification tasks
Image Classification Transformers
V
Barghi
35
0
Vit Base Patch16 224 Finetuned Flower
Apache-2.0
A Vision Transformer model fine-tuned on a flower image dataset based on Google's ViT model, suitable for image classification tasks
Image Classification Transformers
V
RicardC
17
0
My Awesome Food Model
Apache-2.0
Food classification model fine-tuned on the food101 dataset based on Google's ViT model
Image Classification Transformers
M
jinkasreedhar
16
0
My Food Model
Apache-2.0
Food image classification model based on Google Vision Transformer (ViT) architecture, fine-tuned on the Food101 dataset with an accuracy of 90.9%
Image Classification Transformers
M
iammartian0
18
0
Fl Image Category Multi Label
Apache-2.0
This is an image classification model fine-tuned based on Google's ViT model, trained on the fl_image_category_ds dataset with an accuracy of 66.22%.
Image Classification Transformers
F
StephenSKelley
17
1
My Awesome Food Model
Apache-2.0
This is a food classification model fine-tuned on the food101 dataset based on Google's ViT model, achieving an accuracy of 89.5%.
Image Classification Transformers
M
luigg
17
0
Vit Base Patch16 224 Finetuned Algae Wirs
Apache-2.0
This model is a vision classification model fine-tuned on an algae dataset based on Google's ViT model, primarily used for algae image classification tasks.
Image Classification Transformers
V
samitizerxu
20
0
Vit Model Beimer
Apache-2.0
This model is a fine-tuned image classification model based on Google's ViT-base-patch16-224-in21k on the beans dataset, achieving an accuracy of 98.5%.
Image Classification Transformers
V
tadeous
39
0
Cristian Vit
Apache-2.0
This model is an image classification model fine-tuned on a bean dataset based on Google's ViT architecture, achieving 100% accuracy on the validation set.
Image Classification Transformers
C
agudelozc
40
0
Genderage2
Apache-2.0
Vision Transformer model based on ViT architecture for gender and age classification tasks
Image Classification Transformers
G
ivensamdh
263
3
Google Vit Base Patch16 224 Cartoon Face Recognition
Apache-2.0
A cartoon face recognition model fine-tuned based on the Google Vision Transformer (ViT) architecture, excelling in image classification tasks
Face-related Transformers
G
jayanta
62
2
My Awesome Food Model
Apache-2.0
Food image classification model based on ViT architecture, fine-tuned on the Food101 dataset with an accuracy of 89.7%
Image Classification Transformers
M
asd0936
38
0
Vit Base Beans
Apache-2.0
This model is an image classification model fine-tuned on the beans dataset based on Google's ViT architecture, achieving an accuracy of 97.74%.
Image Classification Transformers
V
naveensb8182
22
0
Vit Base Patch16 224 In21k GI Diagnosis
Apache-2.0
A gastrointestinal image classification model based on ViT architecture for diagnosing various conditions from colonoscopy images
Image Classification Transformers English
V
DunnBC22
22
1
Vit Base Patch16 224 Finetuned Flower
Apache-2.0
A Vision Transformer model fine-tuned on a flower image dataset based on Google's ViT model
Image Classification Transformers
V
jonathanfernandes
48
0
Vit Base Beans
Apache-2.0
An image classification model fine-tuned on a bean dataset based on Google's ViT model, achieving an accuracy of 97.74%
Image Classification Transformers
V
socokal
30
0
Dataset Model
Apache-2.0
An image classification model based on ViT architecture, fine-tuned on an image folder dataset
Image Classification Transformers
D
Farideh
30
0
Vit Model
Apache-2.0
This model is an image classification model fine-tuned on the beans dataset based on Google's ViT architecture, designed to identify the health status of legume plants.
Image Classification Transformers
V
jeraldflowers
16
0
Platzi Vit Model Tommasory Beans
Apache-2.0
An image classification model fine-tuned on a bean dataset based on Google's ViT model, achieving 99.25% accuracy
Image Classification Transformers
P
tommasory
30
0
Vit Model Santiago Ahumada
Apache-2.0
This model is a fine-tuned image classification model based on google/vit-base-patch16-224-in21k on a bean dataset, achieving 100% accuracy on the evaluation set.
Image Classification Transformers
V
santiagoahl
31
0
Vit Base Patch16 224 Finetuned Flower
Apache-2.0
A vision classification model fine-tuned on flower image datasets based on Google's ViT model
Image Classification Transformers
V
jafdxc
30
0
Vit Base Patch16 224 Finetuned Flower
Apache-2.0
A vision Transformer model fine-tuned on flower image datasets based on Google's ViT model
Image Classification Transformers
V
jon-fernandes
14
0
Vit Base Patch16 224 In21k Writer Identification
Apache-2.0
Fine-tuned based on Google's ViT model for handwriting recognition tasks
Image Classification Transformers
V
Imene
21
0
Vit Base Beans
Apache-2.0
This model is based on Google's ViT architecture, fine-tuned on the beans dataset, achieving an accuracy of 98.5%.
Image Classification Transformers
V
liangy2
14
0
UCF Crime
Apache-2.0
This model is a fine-tuned version of google/vit-base-patch16-224 on the imagefolder dataset, suitable for vision tasks.
Image Classification Transformers
U
csr2000
46
2
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase